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For  estimating  the  mean  of  a p-variate  normal  distri- 
bution a family  of  estimators  known  as  Stein’s  estimators 
are  known  to  dominate  the  maximum  likelihood  estimator  and 
are  therefore  minimax  when  p>3,  under  quadratic  loss,  equal 
to  the  sum  of  squared  errors  of  the  components  of  the 
estimator.  It  is  shown  in  this  paper  for  a subfamily,  the 
vector  consisting  of  any  K components  of  the  estimator  is 
also  minimax  for  estimating  the  corresponding  components  of 
the  mean  vector,  where  S^K^p. 


1.  Int  roduct  ion . Let  the  p-component  vector  X ■ rXp...,Xp) 

I 

be  normally  distributed  with  mean  (>  » (dj  , . . . ,0^.)  and 
convariance  o'"!  where  p^S,  1 denotes  the  identity  matrix  and 
o is  known.  Tor  estimatinj:  d let  the  loss  be  equal  to  the 
sum  of  squared  errors,  ^iven  by 

P 

1.16, 91  = E (6.-9.)“ 
i = l ' * 

where  6(X)  = ( 6 j ( X)  , . . . , 6^1 X ) ) is  an  estimate  of  9,  based  on  X. 
The  maximum  likelihood  estimator  X is  minimax  but  inadmissible 
for  p^.^  with  respect  to  the  jjiven  loss  function.  The  inadmiss- 
ibility of  X was  first  proved  by  .Stein  (19.^5).  An  estimator 
which  dominates  X and  i.s  known  as  Stein's  estimator,  is  given 
by 

(1.1)  6*(X)  = (1-  ) X 

where  S = Z XT  and  0<v<2.  The  risk  of  6*  is  given  by 
i = l ^ 

(1.2)  R(6*,9)  = i;  L(6*,0) 

= po“(i-v(:-v)(p-:)-R(^)). 

< po“  = R(X,9) . 

We  are  interested  in  assessing  ihe  performance  of  6* 
component-wise.  Let 

(1.3)  R*  = R(6*-0.)^ 


r 


1 


denote  the  contribution  to  tl>c  risk  of  from  the  ith  com- 
ponent. It  is  shov\n  in  the  followinji  section  that  for  any 
integer  K,  such  that 


( 1.4) 


^ * ■> 
I R,  < Ko“ 


for  0<v<  above  inequality  shows  that  6*  is 

minimax  for  estimating  three  or  more  comjioncnts  of  0.  We 
also  generalize  the  above  result  for  the  case  in  which  the 
variances  of  the  components  of  X arc  different  but  known  or 
unknown  up  to  a constant  factor.  Tlie  result  is  extended  to 
another  class  of  estimators. 


2.  Main  results.  We  shall  compute  the  value  of  R^,  the 
contribution  to  the  risk  of  from  the  ith  component. 

First  we  consider  the  case  where  is  known.  We  let  o = 1, 
without  loss  of  generality.  Let  f^^  (,y)  denote  the  density 
function  of  a chi-square  random  variable  with  m degrees  of 


freedom  and 


g(x)  = expl-4tX-0)'  (X-9)) 


denote  the  normal  density  function.  For  any  integrable 


function  <(>,  we  have 


-07/2 


67/2 


(2.1)  f;  X <{.(X  X)  = 0 ^ ' /().(x' x)g(x)dx) 

1“  ti  ^ : 


» 03  f T*  - 1 

-B  0/2  „ (0  0/2J 


where  ■ ■■  denotes  a non-central  chi-square  random  variable 

in , \ ^ 

with  m dejirees  of  freedom  and  noncent  ra  1 i t>’  parameter  equal 


to  > . S i in  i 1 a I'  1 V 


where 


i:  x“vU’x)  = li  . er  i: 


r.et  = he  hue  ’ i J ^ j) 


Ma.b;>)  = 1 - {5  V.  . 'jy  ♦ ••• 


denotesthe  confluent  liype  rgeoiiiet  r ic  function.  l-rom  (1.5), 
(2.1)  and  (2.21  we  eot  by  direct  computation 


(2. .5)  R.  = (l-Xjm^,-  * p-(  , 1 .^411--.)  ♦ i| 

'p  + 2,X  P*4,X  '''P  + 2.\ 


= 1 + (p-2)  e'^/"l^(vi(4  -1.^-  > 1;  4)-2<>(5,?  > 1;  ^) ) 


^ 1.  ? * ^))|. 

is 

An  upper  bound  on  the  value  of  R.  is  obtained  below,  from  the 
iollowinj’  formula.s,  where  prime  denotes  derivative  with  respect 


to  y . 


a ('ll  (a  + 1 ,b  ;y) -<J>  (a  ,b  ;y ) ) = (a,b;y)  = <li  (a+ I ,b  + l ;y) 


(b-a)  i>  (a,b  ♦ l;y) 


bi>(a  ,b;y) -b-t  ia,b;y) 


Since  0<V'2;  we  have 


-1,^  ♦ l;,) 

< 2in5  '^'2  * 

= . ii  -?(i^  12  ♦ ’-A, 

The  quail  f i tv  insiile  the  braces  on  the  rijjht  side  of  12.5)  is 
by  the  yiven  formulas,  equal  to 


(:.5)  ♦ l;4)) 


-p 


_ 22 
i’  ( i> 


-.^Ty  lvlp-2)  + ■*■  ~ t'^)" 


From  (2.5),  (2.4)  and  (2.5)  we  have 


(2.b)  R.  1 ^ ^ 2;4)((v{p-2)  + 4)0“ 


Therefore,  for  5 < K < p 


(2.-)  R*  < K ^ Trf^T  "''^^"*^2’2  ^ 2;A)(v(p-2)  + 4-2K) 


< K = i;  R. 
i=l  ' 


"*  ( K - ’ ) 

for  V < — — * where  R.  denotes  the  contribution  to  the 

p - 2 1 

risk  of  X from  the  ith  component. 


X). 


1 

I 

5 

Usinji  in  (2.3)  the  asymptotic  property  of  the  confluent 
hypergeomet r ic  function,  given  by 

❖ (a,b;y)  - e' y ‘ ( 1 ♦O  (y  ' * ) ) , y>0 

we  have  that  for  largo  values  of  A 

(2.8)  » 1 + i!lr:i^((v,p-2)  ♦ a)  I • ^)  * tia*^). 

From  (2.8)  it  follows  that  v<2 (K- 2 ) / ( p- 2)  is  a necessary 
condition  for  the  inequality  (I.l)  to  hold  uniformly  in  0. 
Moreover,  it  is  known  that  X is  admissible  for  pjf^2.  Tbere* 
fore,  the  inequality  (2.7)  cannot  be  true  for  K<3.  Hence,  we 
have  shown  the  following  result. 

K * K 

Theorem  2.1.  The  inequality  E R.  < E R;  holds  for 

' i=l  ^ i=l  ^ 

all  0 if  and  only  if  S^K^p.  and  0<v‘'2  (K-2)/ (p-2)  . 

2 

Now  we  consider  the  case  when  o is  unknown  but  there 

d 2 2 

is  available  an  estimate  T,  say,  such  that  T ~ o inde- 

2 

pendent  of  X.  In  this  case,  substituting  T for  o in  (1.1), 
Stein's  estimator  is  given  by 

(2.9)  6««(X)  ■ (l-'’*l’g^)^)X 

. (1- 

A 

where  v = vT.  To  compare  6**  with  the  maximum  likelihood 


estimator  X we  let  o = 1 , as  before,  without  loss  of  generality. 


I 


I 

I 


I 

i 

I 
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Substituting  v for  v (.1.2)  and  taking  expectation  with  respect 
to  the  distribution  of  T we  find  that 

R(6**,e)  < R(X,e) 

A 

for  v<2m/ . Similarly,  we  substitute  v for  v in  (2.3) 
and  taking  expectation  with  respect  to  the  distribution  of  T, 
we  obtain  the  risk  of  6**  due  to  the  ith  component,  given  by 


(2.10)  R**  = 1 ^ (p-2)e'^^"[g(^^"**-“-^  ♦ (f  -l.f 

> l;^)) 


l^) 


-p^2  ♦ cf  * 1.  f > 2;4)}]. 

Let  v<2m/(m+2).  Applying  the  method  used  in  deriving  (2.6) 
we  get 


(2.11)  R**  • < 1 * Cp-2)e'*^^l^(»(E  -l.E  * * l;i)l 

■ ^ ^ * <!■?  * . 4 


-21]  . 

' 


From  (2.12)  we  obtain  the  following  theorem  extending 
the  result  of  Theorem  2.1.  The  necessary  part  of  the  theorem 
is  obtained  by  considering  the  asymptotic  property  of  the 
confluent  hypergeometric  function  as  in  the  proof  of  Theorem  2.1. 

K K 

Theorem  2.2.  The  inequality  E R.  < E R.  holds  for 

i*l  ^ i*l  * 

all  0 if  and  only  if  3£K^p  and  0<v<2m(K- 2) / (m'*-2)  (p- 2 ) . 

If  the  loss  function  is  suitably  modified  then  the  results 
given  above  hold  also  for  the  case  in  which  the  components  of 

X are  independent  but  have  unequal  variances,  as  shown  below, 
d 2 

Let  ~ N'(6^, ap,  and  let  the  loss  function  be  given  by 

(2.13)  L(6,e)  = E (6.-0p^/o^. 

i=l  ^ ^ ^ 

First  suppose  that  the  variances  are 

^22 

known.  Let  S = E X./o..  Consider  the  estimator 
i = l ^ ^ 

(2.14)  6 = (l-v(p-2)/S)X 


\ 


It  is  easy  to  see  that  the  conclusion  of  Theorem  2.1  is  valid 


8 


I'or  «5  with  respect  to  the  loss  jjiven  by  (2.13).  Next,  suppose 
that  the  variances  are  known  up  to  a constant  factor,  that  is, 
let  ot’  » aV.  where  a is  unknown  but  the  V.  * s are  known.  If  an 

X 1 I 

estimate  T is  available  for  a,  such  that,  T <1  a then 

the  conclusion  of  Theorem  2.2  is  valid  with  respect  to  the  loss 
(2.13)  for  the  estimator. 

A A 

(2.15)  6 - (l-v(p-2)T/S)X  . 

where  S * E?  .^T/V.  . 

On  the  other  hand,  if  the  loss  function  is  not  modified, 
that  is,  we  consider  the  original  loss  function  equal  to  the 
sum  of  squared  errors,  then  we  can  use  the  estimators  n and  n 

A A 

for  6 and  6,  respectively,  in  the  two  cases  considered  in  the 
preceding  paragraph,  where 

(2.16)  = (l-v(p-2)/oJs*)X. 

= (l-v(p-2)T/ors*)X. 

P 2,4 

where  S*  = Z ^ i • it  follows  from  Theorem  1 of  Berger 
i = l. 

(1976)  that  n dominates  the  maximum  likelihood  estimator  X. 

By  direct  computation  we  have  for  the  contribution  to  the  risk 

A 

of  n from  the  ith  component 


A 


E(n. *0.) 


2 


2Y^ 

2v  (p-  2 ) E i—j^2 


v^(p-2)^EYr/oV 


(2.18) 


X, 


•i  -j  P 2 2 

where  Y.  » — d N(-— ,1)  and  V ■ Z Y:/oT.  l-rom  (2.18)  we 


i 0 . ~ 'o . 
I 1 


i-1 


i 1 


have  for  3^K^p 


^ . K , 

(2.  19)if^  Rf  1 o-  ♦ 2v(p-2)E(^4  - ^)  ♦ v‘{p-2)^:^ 

K , K 

< i:  oT  ■ I R. 

i-1  ‘ i-1  ‘ 

for  0<v<  - . Similarly,  for  the  risk  of  n we  have 


(2.20) 


K s K 
E R;  < r Rj 
i-1  i-1  ^ 


for  0<v<  V • It  is  easily  shown  that  the  sufficient 

(m+2)(p-2) 

conditions  given  above  for  the  inequalities  (2.19)  and  (2.20) 
are  also  necessary.  Therefore,  the  conclusions  of  Theorem 
2.1  and  Theorem  2.2  hold  for  the  estimators  n and  n f respec- 
t ively . 

Remark:  A wide  class  of  estimators  for  the  mean  of  a 
multivariate  normal  distribution  is  known  (see  e.g.  Bock 
(1975))  to  dominate  the  maximum  likelihood  estimator.  It 
would  be  interesting  to  investigate  whether  some  of  these 
estimators  have  the  component -wide  minimax  property  of  Stein's 
estimator,  shown  above.  It  is  also  interesting  to  investigate 
similar  property  for  the  estimator  6^  , obtained  by  substi- 
tuting for  the  quant  i ty  ins ide  the  paranthesis  on  the  right 
side  of  (1.1)  by  its  positive  part,  tlie  estimator  6 is  known  to 
dominate  6*. 
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